Corpus: roh-ch_web_2017

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 14927 s-
2 11289 c-
3 9694 d-
4 9273 p-
5 7923 «-
Top Character Bigrams
word rank frequency n-gram
1 3352 re-
2 3132 in-
3 3044 co-
4 2843 cu-
5 2787 pr-
Top Character Trigrams
word rank frequency n-gram
1 1594 con-
2 1194 pro-
3 1098 cha-
4 1078 pre-
5 1065 sch-
Top Character 4-Grams
word rank frequency n-gram
1 755 tsch-
2 554 cump-
3 510 inte-
4 489 cunt-
5 462 cons-
Top Character 5-Grams
word rank frequency n-gram
1 389 inter-
2 270 tsche-
3 265 trans-
4 205 cunte-
5 180 tschi-
2007 msec needed at 2018-06-04 16:01